Pattern Based Term Extraction Using ACABIT System

نویسندگان

  • Koichi Takeuchi
  • Kyo Kageura
  • Teruo Koyama
  • Béatrice Daille
  • Laurent Romary
چکیده

In this paper, we proposed pattern based term extraction model for Japanese applying ACABIT system developed for French. Proposed model evaluates termhood using morphological patterns of basic terms and term variants. After extracting term selections, ACABIT system filters non-terms out from the selections based on simple log likely hood evaluation. This approach would be suitable to Japanese term extraction because most of Japanese terms form compound nouns or simple phrasal patterns. After showing the morphological patterns for terms, we show experimental results. By comparing morphological patterns with foreign languages, we discuss morphological units in Japanese. Keyword Term extraction, Pattern based, Morphological patterns, Termhood 文法パターンに基づく用語抽出モデルの構築

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Analysis of Wavelet-based Feature Extraction for Intramuscular EMG Signal Decomposition

Background: Electromyographic (EMG) signal decomposition is the process by which an EMG signal is decomposed into its constituent motor unit potential trains (MUPTs). A major step in EMG decomposition is feature extraction in which each detected motor unit potential (MUP) is represented by a feature vector. As with any other pattern recognition system, feature extraction has a significant impac...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

A Real-Time Electroencephalography Classification in Emotion Assessment Based on Synthetic Statistical-Frequency Feature Extraction and Feature Selection

Purpose: To assess three main emotions (happy, sad and calm) by various classifiers, using appropriate feature extraction and feature selection. Materials and Methods: In this study a combination of Power Spectral Density and a series of statistical features are proposed as statistical-frequency features. Next, a feature selection method from pattern recognition (PR) Tools is presented to e...

متن کامل

Evaluating a multi-word term indexing system: method, implementation and report

This article presents the evaluation by experts of the efficiency of a platform performing automatic multi-terms indexing. This evaluation is divided into three parts: firstly the evaluation of controlled indexing, then free indexing and finally the relevance of the variants found during controlled indexing. We present first the data and the two modules submitted to this evaluation ACABIT and F...

متن کامل

Two realizations of a general feature extraction framework

A general feature extraction framework is proposed as an extension of conventional linear discriminant analysis. Two nonlinear feature extraction algorithms based on this framework are investigated. The 1rst is a kernel function feature extraction (KFFE) algorithm. A disturbance term is introduced to regularize the algorithm. Moreover, it is revealed that some existing nonlinear feature extract...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0907.2452  شماره 

صفحات  -

تاریخ انتشار 2009